PCT 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 

G01N 33/574, 33/577, C12Q 1/68, C07K 
16/30, A61K 39/395, 38/17 



Al 



(11) International Publication Number: WO 97/46884 

(43) International Publication Date: 1 1 December 1997 (1 1.12.97) 



(21) International Application Number: 

(22) International Filing Date: 



PCT/US97/09529 
3 June 1997 (03.06.97) 



(30) Priority Data: 
08/658.639 



5 June 1996 (05.06.96) 



US 



(71) Applicant: MATRITECH, INC. [US/US]; 330 Nevada Street, 

Newton, MA 02160 (US). 

(72) Inventors: K EES EE, Susan, K.; 87 Oak Hill Road, Harvard, 

MA 01451 (US). OBAR. Robert; 19 Lewis Avenue, 
Walpole, MA 02081 (US). WU. Ying-Jye; 71 Flanagan 
Drive, Framingham, MA 01701 (US). 

(74) Agent: MEYERS, Thomas, C; Testa, Hurwitz & Thibeault, 
LLP, High Street Tower. 125 High Street, Boston, MA 
02110 (US). 



(81) Designated Suites: AU. CA, JP, European patent '(AT. BE, 
CH. DE, DK, ES, FI. FR, GB. GR. IE, IT, LU, MC, NL, 
PT, SE). 



Published 

With international search report. 



(54) Title: MATERIALS AND METHODS FOR DETECTION OF BREAST CANCER 
(57) Abstract 



Provided are materials and methods for early diagnosis of breast cancer by detection of breast cancer-associated proteins. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


I-esothn 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


5K 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


5N 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnii and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Malt 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


a 


Israel 


MR 


Mauritania 


UC 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZVY 


Zimbabwe 


ci 


Cote d'l voire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






CU 


Cuba 


KZ 


Kaz&kttan 


RO 


Romania 






cz 


Ciech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


UK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







WO 97/46884 



PCT/US97/09529 



MATERIALS AND METHODS FOR DETECTION 
OF BREAST CANCER 



Field of the Invention 

The present invention relates to materials and methods for 
the detection of breast cancer, including cellular markers 
indicative of the likelihood of the presence of breast cancer. 

5 Background of the Invention 

Breast cancer is a leading cause of death in women. While 
the pathogenesis of breast cancer is unclear, transformation of 
normal breast epithelium to a malignant phenotype may be the 
result of genetic factors, especially in women under 30. Miki, 

10 et al., Science, 266: 66-71 (1994). However, it is likely that 
other, non-genetic factors also have a significant effect on the 
etiology of the disease. Regardless of its origin, breast 
cancer morbidity' increases significantly if it is not detected 

ear Ty~i"n — its progress Ton 7" "Thus,— consrderable effort-has- focused 

15 on the elucidation of early cellular events surrounding 

transformation in breast tissue. Such effort has led to the 
identification of several potential breast cancer markers. For 
example, alleles of the BRCA1 and BRCA2 genes have been linked 
to hereditary and early-onset breast cancer. Wooster, et al . , 

20 Science, 265: 2088-2090 (1994). The wild-type BRCA1 allele 
encodes a tumor supressor protein. Deletions and/or other 
alterations in that allele have been linked to transformation of 
breast epithelium. Accordingly, detection of mutated BRCA1 
alleles or their gene products has been proposed as a means for 

25 detecting breast, as well as ovarian, cancers. Miki, et al., 
supra. However, BRCA1 is limited as a cancer marker because 
BRCA1 mutations fail to account for the majority of breast 
cancers. Ford, et al . , British J. Cancer, 72: 805-812 (1995). 
Similarly, the BRCA2 gene, which has been linked to forms of 
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hereditary breast cancer, accounts for only a small portion of 
total breast cancer cases. Ford, et al., supra. 

Several other genes have been linked to breast cancer and 
may serve as markers for the disease, either directly or via 

5 their gene products. Such potential markers include the TP53 
gene and its gene product, the p53 tumor supressor protein. 
Malkin, et al., Science, 250: 1233-1238 (1990). The loss of 
heterozygosity in genes such as the ataxia telangiectasia gene 
has also been linked to a high risk of developing breast cancer. 

10 Swift, et al., N. Engl. J. Med., 325: 1831-1836 (1991). A 

problem associated with many of the markers proposed to date is 
that the oncogenic phenotype is often the result of a gene 
deletion, thus requiring detection of the absence of the wild- 
type form as a predictor of transformation. 

15 Of interest to the present invention are reports that the 

protein content of the nuclear matrix in breast epitheiia may 
provide a marker of cellular growth and gene expression in those 
cells. Khanuja, et al., Cancer Res. , 53: 3394-3398 (1993). The 
nuclear matrix forms the superstructure of the cell nucleus and 

20 comprises m ulti ple prote in compon ents that are no t fully ______ 

characterized. The nuclear matrix also provides the structural 
and functional organization of DNA. For example, the nuclear 
matrix allows DNA to form loop domains. Portions of DNA in such 
loop domains have been identified as regions comprising 

25 actively-transcribing genes. Ciejek, et al., Nature, 306: 607- 
609 (1982) . Moreover, the organization of the nuclear matrix 
appears to be 'tissue-specific and has been associated with so- 
called transformation proteins in cancer cells. Getzenberg, et 
al., Cancer Res., 51: 6514-6520 (1991); Stuurman, et al., J. 

30 Biol. Chem. , 265: 5460-5465 (1990). 

Proteins and steroid hormones thought to be involved in 
transformation are associated with the nuclear matrix in certain 
cancer cells. Getzenberg, et al., Endocrinol . Rev., 11: 399-417 
(1990) . It has been suggested that changes in the composition 

35 or organization of nuclear matrix proteins may be useful as 
markers of growth and gene expression in breast tissue. 



WO 97/46884 



PCIYUS97/09529 



-3- 

Khanuja, et al . , Cancer Res. , 53: 3394-3398 (1994). However, 
Khanuja did not identify any specific proteins for use as cancer 
markers . 

There is, therefore, a need in the art for specific, 
5 reliable markers that are differentially expressed in normal and 
transformed breast tissue and that may be useful in the 
diagnosis of breast cancer or in the prediction of its onset. 
Such markers and methods for their use are provided herein. 

Summary of the Invention 

10 The invention provides materials and methods for diagnosis 

and detection of breast cancer in tissue or in body fluid. In a 
preferred embodiment, methods according to the invention 
comprise the step of detecting in a sample of tissue or body 
fluid the presence of a protein that is not normally expressed 

15 in non-transformed (i.e., noncancerous) breast cells. Such 
proteins are typically found in the nuclear matrix fraction of 
cells or cellular material isolated according to the method of 
Fey, et al . Proc. Nat'l. Acad. Sci . (USA), 85: 121-125 (1988), 
incorporated by' reference herein. Accordingly, such proteins 
--20 — are -a-l-ternati-vely— ref er-red-to-here — 
proteins or breast cancer-associated nuclear matrix proteins. 
It is understood that, for purposes of the present invention, a 
breast cancer-associated protein, including a nuclear matrix 
protein, is one that is detectable in breast cancer cells and 

25 not detectable in non-cancerous cells and which can be isolated 
as described herein. 

In a preferred embodiment, methods of the invention 
comprise the step of detecting in a sample the presence of a 
protein or protein fragment having a molecular weight of from 

30 about 22,000 Daltons to about 81,000 Daltons and further having 
an isoelectric point of from about 5.24 to about 7.0. Also 
preferred are methods comprising the step of detecting in a 
sample the presence of a peptide comprising a continuous amino 
acid sequence selected from the group consisting of SEQ ID NO: 

35 1, SEQ ID NO: 2, SEQ ID NO: 3. SEQ ID NO: 4, SEQ ID NO: 5, SEQ 
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ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 13, and SEQ ID NO: 14. 

Methods of the invention may be performed on any relevant 
tissue or body fluid sample. In preferred embodiments, methods 
of the invention are carried out in breast tissue and preferably 
breast biopsy tissue. However, inventive methods are also useful 
in assays for metastasized breast cancer cells in other tissue 
or body fluid samples. Methods for detecting breast cancer- 
associated proteins in breast tissue may comprise exposing such 
tissue to an antibody directed against a target breast cancer- 
associated protein. The antibody may be polyclonal or 
monoclonal and may be detectably labeled for identification of 
antibody. 

A detecting step according to the invention may comprise 
amplifying nucleic acid encoding a target breast cancer- 
associated protein using a polymerase chain reaction or a 
reverse-transcriptase polymerase chain reaction. Detection of 
products of the polymerase chain reaction may be accomplished 
using known techniques, including hybridization with nucleic 

a cid probes com plement ary to the amplified sequence. A ^ 

detecting step according to the present invention may also 
comprise using nucleic acid probes complementary to at least a 
portion of a DNA encoding a breast cancer-associated protein. 

The present invention also provides proteins and protein 
fragments that are characteristic of breast cancer cells. Such 
proteins and protein fragments are useful in the detection and 
diagnosis of breast cancer as, for example in the production of 
antibodies. The invention also provides nucleic acids encoding 
breast cancer-associated proteins. The nucleic acids themselves 
are contemplated as markers and may be detected in order to 
establish the presence of breast cancer or a predisposition 
therefor . 

Breast cancer-associated proteins in a tissue or body fluid 
sample may be detected using any assay method available in the 
art. In one embodiment, the protein may be reacted with a 
binding moiety, such as an antibody, capable of specifically 
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binding the protein being detected. Binding moieties, such as 
antibodies, may be designed using methods available in the art 
so that they interact specifically with the protein being 
detected. Optionally, a labeled binding moiety may be utilized. 

5 In such an embodiment, the sample is reacted with a labeled 

binding moiety capable of specifically binding the protein, such 
as a labeled antibody, to form a labeled complex of the binding 
moiety and the target protein being detected. Detection of the 
presence of the labeled complex then may provide an indication 

10 of the presence of a breast cancer in the individual being 
tested. 

In another embodiment, one or more breast cancer-associated 
protein (s) in a sample may be detected by isolation from the 
sample and subsequent separation by two-dimensional gel 

15 -electrophoresis to produce a characteristic two-dimensional gel 
electrophoresis pattern. The cancer cell gel electrophoresis 
pattern then may be compared with a standard pattern obtained 
from non-cancer cells. The standard may be obtained from a 
database of gel^ electrophoresis patterns. 

20 In another embodiment, oligonucleotide probes are designed 

using standard methods and are used tcTidentify DNA or m*RNA~ 
encoding breast cancer-associated protein. See, e.g., 
Maniatis et al . , "Molecular Cloning: A Laboratory Manual," Cold 
Spring Harbor Press (1989) . 

25 In another embodiment, a nucleic acid molecule may be 

isolated that comprises a sequence capable of recognizing and 
being specifically bound by a breast cancer-associated protein. 
As used herein, the term "specifically bound" refers to a 
binding affinity of greater than about 10 5 M" 1 . 

30 Nucleic acid in a sample may also be detected by, for 

example, a Southern blot analysis by reacting the sample with a 
labeled hybridization probe, wherein the probe is capable of 
hybridizing specifically with at least a portion of the target 
nucleic acid molecule. Therefore, detection of the target 

35 nucleic acid molecule in a sample can serve as an indicator of 
the presence of breast cancer in the patient being tested. A 
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nucleic acid binding protein may also be used to detect nucleic * 
acid encoding breast cancer-associated proteins. 

Numerous additional aspects and advantages of the invention 
will become apparent upon consideration of the following 
detailed description thereof. 

Description of the Drawings 

Figure 1 is a two-dimensional gel electrophoresis pattern 
produced by nuclear matrix proteins obtained from a breast 
cancer tissue sample. Arrows 1 through 8 indicate proteins that 
are expressed in breast cancer tissue but not in normal tissue. 

Figure 2 is a two-dimensional gel electrophoresis pattern 
produced by nuclear matrix proteins obtained from a normal 
breast tissue sample. 

Detailed Description of the Invention 

The present invention provides marker proteins, for 
example, nuclear matrix proteins, that are expressed in breast 
tumor cells but not in non-cancerous breast cells. The 
proteins, nucleic acids encoding them, and antibodies directed 
against them are useful in diagnostic assays and kits for early 
-detect±on~o f ~t)reast -cancer -ox- the-lirkel-i-hGod-of- onse^of- breast - 
cancer. While detection of a single breast cancer-associated 
protein is sufficient to detect breast cancer cells, diagnostic 
methods according to the invention may include detection of more 
than one marker protein in a tissue or body fluid sample. 
Materials and methods of the invention provide consistent and 
reliable means for detection of a variety of breast cancers, 
including hereditary forms and induced forms. 

Breast cancer protein markers may* be isolated, purified, 
and characterized according to well-known techniques. Proteins 
are commonly characterized by their molecular weight and 
isoelectric point. Marker proteins according to the present 
invention and for use in methods of the invention are 
characterized as being detectable by two-dimensional gel 
electrophoresis of proteins isolated from breast cancer ceils 
and not detectable by two-dimensional gel electrophoresis of 
proteins isolated from normal cells. For purposes of the 
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present invention, the term normal cells refers to cells that 
are not cancerous or pre-cancerous . 

Breast cancer-associated proteins may be isolated from a 
sample by any protein isolation method known to those skilled in 
the art, such as affinity chromatography. As used herein, 
"isolated" is understood to mean substantially free of 
undesired, contaminating proteinaceous material. For example, a 
breast cancer-associated nuclear matrix protein may be isolated 
from a cell sample using the methods for isolating nuclear 
matrix proteins disclosed in U.S. Patent No. 4,885,236 and U.S. 
Patent No. 4, 882, 268 (Such proteins are referred to therein as 
internal nuclear matrix proteins), the disclosures of which are 
incorporated by reference herein. 

In such isolation procedures, mammalian cells are generally 
extracted with an extraction solution comprising protease 
inhibitors, RNase inhibitors, and a non-ionic detergent-salt 
solution at physiological pK and ionic strength, to extract 
proteins in the nucleus and cytoskeleton that are soluble in the 
extraction solution. The target proteins then are separated 
from the cytoskeleton remaining in the extracted cells by 

"s^lubiiizing ~the"cyto"s^ 
protease inhibitors and a salt solution (such as 0.25 M (NH 4 )vS0 4 ) 
which does not dissolve the target proteins. The chromatin then 
is separated from the target proteins by digesting the insoluble 
material with DNase in a buffered solution containing protease 
inhibitors. The insoluble proteins then are dissolved in a 
solubilizing agent, such as 8 M urea plus protease inhibitors, 
and dialyzed into a physiological buffer comprising protease 
inhibitors, wherein the target proteins are soluble in the 
physiological buffer. Insoluble proteins are removed from the 
solution. 

Marker proteins in a sample of tissue or body fluid may be 
detected "in binding assays, wherein a binding partner for the 
marker protein is introduced into a sample suspected of 
containing the marker protein. In such an assay, the binding 
partner may be detectably labeled as, for example, with a 
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radioisotopic or fluorescent marker. Labeled antibodies may be 
used in a similar manner in order to isolate selected marker 
proteins. Nucleic acids encoding marker proteins may be 
detected by using nucleic acid probes having a sequence 

5 complementary to at least a portion of the sequence encoding the 
marker protein. Techniques such as PCR and, in particular, 
reverse transcriptase PCR, are useful means for isolating 
nucleic acids encoding a marker protein. The following examples 
provide details of the isolation and characterization of breast 

10 cancer-associated proteins and methods for their use in the 
detection of breast cancer. 

Example 1 

Isolation of Breast Cancer-Associated Nuclear 
Matrix Protein From Breast Cancer Tissue Samples 

15 Breast cancer-associated nuclear matrix proteins were 

identified by comparing two-dimensional gel electrophoretic 
profiles of breast cancer cells and non-cancerous breast cells 
under normal silver-staining conditions. 

Nuclear matrix proteins were isolated from breast cancer 

20 tissue using a modification of the method of Fey, et al . , Proc. 
Natl~ Acad. Sci \ (USA) ,"~~85 : 12T-T25 1 T988T, 1 n "corporate d~by 
reference herein. Fresh breast cancer tissue specimens, ranging 
in size from about 0.2 g to about 1.0 g, were obtained from ten 
infiltrating ductal carcinomas from different patients. Samples 

25 were minced into small (1 mm 3 ) pieces and homogenized with a 
Teflon pestle on ice. 

Nuclear matrix proteins from normal breast tissue were 
extracted as 50 g to 100 g samples from reduction mammoplasty 
patients. Samples were minced into small (1 mm J ) pieces and 

30 disaggregated overnight at 37° C (5% C0 2 ) in a buffered salt 
solution (Hanks Balanced Salt Solution without Ca # */Mg**) 
containing antibiotics, 10% fetal calf serum, 1 mg/mL 
collagenase A {Boehringer Mannheim), and 0.5 mg/mL dispase 
(Boehringer Mannheim) . Following disaggregation, cells were 

35 collected by centrifugation. Large aggregates were removed by 

filtration through nylon mesh (Nitex, 250 jiM) . Contaminating red 
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blood cells were lysed in a solution of buffered ammonium 
chloride (0.31 M) . The resulting cell suspension containing 
normal breast epithelial cells was washed and counted. 

Both breast tumor and normal tissue, each prepared as 
5 described above, were treated with a buffered solution 

containing 0.5% Triton X-100, vanadyl ribonucleoside complex 
(RNase inhibitor, 5' -3') plus a protease inhibitor cocktail 
(phenylmethyl sulfonyl fluoride, Sigma, St. Louis, Mo.; and 
aprotinin and leupeptin, Boehringer Mannheim) to remove lipids 

10 and soluble protein. 

Soluble cytoskeletal proteins were then removed by 
incubating the resulting pellet in an extraction buffer 
containing 250 mM (NH 4 )2S0 4 , 0.5% Triton X-100, vanadyl 
ribonucleoside complex plus a protease inhibitor cocktail for 10 

15 minutes on ice followed by centrifugation . Chromatin was 

removed by incubating the pellet in DNase I (100 micrograms per 
mL) in a buffered solution containing protease inhibitor 
cocktail for 45 minutes at 25°C. 

The remaining pellet fraction, containing nuclear matrix 

20 protein, was solubilized in a disassembly buffer containing 8 M 

___ _ — ur e a ~an d~p r o t e a s e ~ inhibit or- cock t a ri _ p 1 u s - 1% 2-mercaptoethanol ; — 
Insoluble contaminants, primarily consisting of carbohydrates 
and extracellular matrix, were removed by ul tracentrif ugation . 
Target nuclear matrix proteins remained in the supernatant. 

25 Protein concentration was determined using a Coomassie Plus 
Protein Assay Kit (Pierce Chemicals, Rockford, IL) using a 
bovine gamma globulin standard. Proteins were then precipitated 
and stored at -80 °C. 

Nuclear matrix proteins were next characterized by high- 

30 resolution two-dimensional gel electrophoresis using isoelectric 
focusing according to the procedure of O'Farrell, J. Biol. 
Chem., 250: 4007-4021 (1975), on the Investigator 2-D system 
(Millipore, Bedford, MA) . Nuclear matrix proteins were 
solubilized for isoelectric focusing analysis in a sample buffer 

35 containing 9 M urea, 65 mM 3- [ (cholamidopropyl) dimethylamino] -1- 
propanesulf ate (CHAPS), 2.2% ampholytes, and 140 mM 
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dithiothreitol (DTT) . One-dimensional isoelectric focusing was 
carried out for 18,000 volt-hours using 1 mm x 18 mm gel tubes. 
Following first dimension electrophoresis/ gels were extruded 
from gel tubes, equilibrated for 2 minutes in a buffer 
5 containing 0.3 M Tris base, 0.075 M Tris-HCl, 3.0% SDS, 50 mM 
DTT, and 0.01% bromophenol blue and placed on top of 1 mm 10% 
Tris-glycine-SDS Duracryl (Millipore) high tensile strength 
polyacrylamide electrophoresis slab gels. Second dimension slab 
gels were electrophoresed at 16 Watts per gel and 12 °C constant 
10 temperature for approximately 5 hours. Molecular weight 

standards consisted of bovine albumin (M L 66,000), ovalbumin (M t 
45,000), glyceraldehyde-3-phosphate dehydrogenase (M r 36,000), 
carbonic anhydrase (M r 29,000), bovine pancreatic trypsinogen (M r 
24,000), and soybean trypsin inhibitor (M f 20,100). Following 
15 electrophoresis, gels were fixed in a solution containing 40% 
ethanol/10% acetic acid followed by treatment with a solution 
containing 0.5% glutaraldehyde . Gels were washed extensively 
and silver stained according to the method of Rabillioud, et 
al.r Electrophoresis, 13: 429-439 (1992) and dried between 

-20 sheets of cellophane paper. 1 

Silver-stained gels were imaged using a MasterScan 
Biological Imaging System (CSP, Inc., Billerica, MA) according 
to the manufacturer's instructions. Digital filtering 
algorithms were used to remove both uniform and non-uniform 
25 background without removing critical image data. Two-D scan 

(TM) two-dimensional gel analysis and database software (version 
3.1) using multiple Gaussian least-squares fitting algorithms 
were used to compute spot patterns into optimal-fit models of 
the data as reported by Olson, et al., Anal. Biochem. , 169: 49- 
30 70 (1980) . Triangulation from the internal standards was used 
to precisely determine the molecular weight and isoelectric 
point of. each target protein of interest. Interpretive 
densitometry was performed using specific software application 
modules to integrate the data into numeric and graphical reports 
35 for each gel being analyzed. 
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Example 2 

Identification of Breast Cancer-Associated Nuclear Matrix 
Proteins Having Differential Appearance on 2-D Gels 

As described in the previous Example, 2-D gel 

5 electrophoresis patterns were obtained from samples containing 

normal breast cells and from samples containing breast cancer 

cells. Figure 1 shows a typical gel pattern produced by nuclear 

matrix proteins obtained from a normal breast tissue sample. 

Figure 2 shows a typical breast cancer-associated nuclear matrix 

10 protein pattern obtained from breast cancer tissue. Comparison 
of Figures 1 and 2 reveals that, while most proteins in the 
cancer and non-cancer samples are identical, there are eight 
proteins that are unique to the breast cancer sample (labeled in 
Figure 1). Table 1 identifies those proteins, designated BOl 

15 through BC-8, by their approximate molecular weight and 

isoelectric point. Both the molecular weight and isoelectric 
point values listed in Table 1 are approximate and accurate to 
within 1,000 Daltons for molecular weight and to within 0.2 pH 
units for isoelectric point. 

20 Table 1 

Peptide Molecular Weight Isoelectric Point Breast Cancer Normal Breast 



BC-1 80,735 5.24 + 

BC-2 32, 490 6.82 + 

BC-3 28,969 5.66 + 

BC-4 28,723 6.83 + 

BC-5 31,111 5.36 + 

BC-6 22,500 5.58 + 

BC-7 38,700 6.90 + 

BC-8 33,000 6.44 + 



Three of the breast cancer-associated nuclear matrix 
proteins that are specific to breast cancer cells were isolated 
and processed for tryptic peptide mapping and amino acid 
25 sequencing . 

Example 3 

Characterization of Breast Cancer-Associated 
Nuclear Matrix Protein Markers 
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Three of the breast cancer-associated nuclear matrix 
proteins were partially sequenced. The nuclear matrix fraction 
from a single human breast adenocarcinoma was electrophoresed on 
10% two-dimensional gels in the manner described above. 

5 Thereafter, proteins were visualized by soaking the gels in 

200mM imidazole for 10 minutes and then rinsing for 1 minute in 
water, followed by 1-2 minutes in 300mM zinc chloride. After 
protein-containing spots began to appear, the gels were placed 
in water and relevant gel spots were excised. The isolated gel 

K) spots, each representing individual breast cancer-associated 
nuclear matrix proteins, were pooled. Destaining was 
accomplished by washing for 5 minutes in 21 citric acid followed 
by several washes in 100 mM Tris hydrochloride at pH 7.0 in 
order to raise the pH within the isolated gel spots. 

15 Each set of pooled gel spots was then diluted with an equal 

volume of 2x SDS-PAGE sample buffer (250mM Tris-cl, 2% SDS, 20% 
glycerol, 0.01% bromophenol blue, 10% p-mercaptoethanol , pH 6.8) 
and incubated at 75°C for 3 minutes. Samples were then cooled on 
ice and loaded into the lanes of a 4% polyacrylamide 

_20.- __ jst a c k i n g .£LL%_p_o_l_y_a c r y 1 am ide _ se p a _r a tin g_ SDS-P_AG_E__qe 1_. , 

Electrophoresis was accomplished in Ix Tank buffer {25mM Tris- 
HC1, 192mM glycine, lft SDS, pH 8.3) to focus gel spots into 
bands. Molecular weight markers (BioRad #161-0304) were used on 
each gel to compare the observed molecular weights on one- and 

25 two-dimensional gels. 

The gels were then electroblotted onto Immobilon-PVDF 
membranes (Millipore) according to the method reported in 
Towbin, et al., Proc. Nat'l. Acod. Sci., 76: 4350-4354 (1979), 
as modified for a mini-gel format by Matsudaira, et al . , J. 

30 Biol. Chem. , 262: 10035 (1987), incorporated by reference 
herein. Membranes were then stained for 1 minute with 0.1% 
Buffalo Black (13 acetic acid, 40% methanol) and rinsed with 
water. Regions containing polypeptide bands were then excised 
with a scalpel. 

35 The resulting PVDF-bound polypeptides were then subjected 

to tryptic peptide mapping and microsequencing by the method of 
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Fernandez, et al., Analytical Biochem. , 218: 112-117 (1994), 
incorporated by reference herein, using a Hewlett-Packard Model 
1090M HPLC. Sequence determinations were made on an Applied 
Biosystems Pro Cise Sequenator. Most sequences were confirmed 
by MALDI-TOF mass spectrometry of the individual peptides. 

The results of sequencing of the BC-2, BC-6, and BC-8 
peptide fragments are provided in Table 2 below. 

Table 2 

Peptide Fragments SEQ ID NO. Predicted Observed 



Sequenced 



Mass 



Mass 



BC-6 



DLISHDEMFSDIYK 



1 



1714.55 



1712.9 



TEGNIDDSLIGGNASA 



2 



4859.22 



4859.19 



BC-2 



KAEAAASAL 



3 



KFVLMR 



4 



ANIQAVSLK 



5 
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Table 2 (Continued) 

BC-8 SDWPMTAENFR 6 1367.21 1365.5 

I I PQFMCQGGDFXNHR 7 2296.44 2293.3 

KFDDENFILR 8 1269.97 1268.4 

HWFGEVTEGLDVLR 9 1670.93 1669.9 

VIIADCGEY 10 

As shown in Table 2, two fragments of the peptide 
designated BC-6 were sequenced. Analysis in the GenBank 
5 database revealed that those sequence fragments (SEQ ID NOS: 1 
and 2) are identical to portions of the translationally- 
controlled tumor protein (TCTP) . The TCTP protein is abundantly 
transcribed under strict translational control in mouse and 
human tumor cell lines. However, its function is unknown. 

10 A large, contiguous sequence, designated BC-2 (SEQ ID NO: 

12), was obtained based upon the three smaller fragments shown 
in Table 2 (SEQ ID NOS: 3-5). A search in the GenBank database 
revealed an expressed sequence tag cDNA clone encoding an amino 
acid sequence substantially identical to that of the BC-2 

_ J5 f r a gme.nt . The. _c o_d i n g_ s equ e n c_e __i s_ s h o_wn___i n__ _S EQ_ I D_ _NO .:_ 11 Whi le— 

the expressed sequence tag corresponding to a portion of the BC- 
2 fragment does not clearly fit into any known molecular family, 
there is an homology between a segment of BC-2 and a putative 
16.7 Kda protein encoded by a gene on yeast chromosome XI. The 

20 function of the yeast protein is not known. 

Finally, an approximately 33,000 Dalton breast cancer- 
associated nuclear matrix protein having an isoelectric point of 
approximately 6.44 was sequenced from the 2D gels described 
above. That protein, designated BC-8, was partially sequenced 

25 to produce five sequence, fragments, shown in SEQ ID NOS: 6-10, 
respectively. A search in the GenBank database revealed a high 
degree of homology between each of those five sequences and 
portions 'of the amino acid sequences of several members of the 
cyclophilin superfamily. The BC-8 peptide appears to contain a 

30 typical cyclophilin domain of about 150 amino acids that is 

about 70% identical to cyclophilin A, the archetypal member of 
the cyclophilin superfamily . 



[) 



WO 97/46884 PCT/US97/09529 

- 15- 

In addition, the data indicate that there are at least two 
distinct RNA isoforms encoding BC-8. The observed amino acid 
sequences corresponding to each isoform are shown in SEQ ID NOS: 
13 and 14. 

Breast cancer-associated nuclear matrix proteins may be 
identified based upon the partial amino acid and nucleotide 
sequences provided above using well-known techniques. Thus, 
breast cancer-associated nuclear matrix proteins detected 
according to methods of the invention may be referred to as 
comprising a continuous sequence shown in the above-noted 
sequence fragments. The skilled artisan understands, for 
example, that fragments provided above are sufficient to provide 
an epitope for binding of an antibody directed against a breast 
cancer-associated nuclear matrix protein. Moreover, nucleotide 
sequences encoding the fragments described above are sufficient 
for hybridization using complementary oligonucleotide probes. 
Example 4 

Use of Differentially-Detected Markers 
to Detect Breast Cancer 

Once identified, a breast cancer-associated protein, such 
as a nuclear matrix protein, may be detected in a tissue or body 
fluid sample using numerous binding assays that are well known 
to those of ordinary skill in the art. For example, a target 
protein in a sample may be reacted with a binding moiety capable 
of specifically binding the target protein. The binding moiety 
may comprise, for example, a member of a ligand-receptor pair 
(i.e., a pair, of molecules capable of specific binding 
interactions), antibody-antigen, enzyme-substrate, nucleic acid- 
nucleic acid, protein-nucleic acid, or other specific binding 
pairs known in the art. : Binding proteins may be designed which 
have enhanced affinity for a target protein. Optionally, the 
binding moiety may be linked to a detectable label, such as an 
enzymatic, fluorescent, radioactive, phosphorescent or colored 
particle label. The labeled complex may be detected visually or 
with a spectrophotometer or other detector. 
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The proteins also may be detected using gel electrophoresis 
techniques available in the art, as disclosed, for example, in 
Maniatis et al., "Molecular Cloning: A Laboratory Manual, " Cold 
Spring Harbor Press, (1989). In two dimensional gel 
electrophoresis, proteins are first separated in a pH gradient 
gel according to their isoelectric point. This gel then is 
placed on a polyacrylamide gel and the proteins are separated 
according to molecular weight. (See, e.g., O'Farrell, J. Biol. 
Chem. 250: 4007-4021 (1975) and Example 1, supra). 

A breast cancer-associated protein or normal breast cell- 
associated protein in a sample may be detected using immunoassay 
techniques available in the art. The isolated breast cancer- 
associated protein or normal breast cell-associated proteins 
also may be used for the development of diagnostic and other 
tissue-evaluating kits and assays. 

One or more proteins associated with breast cancer may be 
detected by isolating proteins from a sample, such as a breast 
tissue cell sample from a patient, and then separating the 
nrnt-Pins by two dimensional gel ele ctrophores is to produce a 
characteristic two dimensional gel electrophoresis pattern. The 
pattern then may be compared with a standard gel pattern derived 
from normal or cancer cells processed under identical 
conditions. The standard may be stored or obtained in an 
electronic database of electrophoresis patterns. The presence 
of a breast cancer-associated protein in the two-dimensional gel 
provides an indication of the presence of breast cancer in the 
sample being tested. The detection of two or more breast 
cancer-associated proteins increases the stringency of methods 
according to the invention. 

Suitable kits for detecting breast cancer-associated 
proteins include a receptacle or other means for capturing a 
sample to be evaluated, and means for detecting the presence 
and/or quantity in the sample of one or more of the breast 
cancer-associated proteins described herein. Where the presence 
of a protein within a cell is to be detected, the kit also may 
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comprise means for disrupting the cell structure so as to expose' 
intracellular proteins. 

A sandwich immunoassay technique may be utilized to detect 
breast cancer-associated protein or protein from normal cells. 

5 In that method, two antibodies capable of binding the target 
protein are used, one immobilized onto a solid support and one 
free in solution and detectably labeled. Examples of labels 
that may be used for the second antibody include radioisotopes, 
fluorescent compounds, haptens, and enzymes or other molecules 

iu that generate colored or electrochemically active products when 
exposed to a reactant or enzyme substrate. When a sample 
containing the target protein is placed in this system, the 
target protein binds to both the immobilized antibody and the 
labeled antibody to form a "sandwich" immune complex on the 

15 support surface. The complexed protein is detected by washing 
away non-bound sample components and excess labeled antibody, 
and measuring the amount of labeled antibody complexed to 
protein on the support surface. 

The sandwich immunoassay is highly specific and very 

20 sensitive, provided that labels with good limits of detection 
are used. A detailed review of immunological assay design, 
theory and protocols can be found in numerous texts in the art, 
including Practical Immunology, Butt, W.R., ed., Marcel Dekker, 
New York, 1984. In general, immunoassay design considerations 

25 include preparation of antibodies (e.g., monoclonal or 

polyclonal) having sufficiently high binding specificity for the 
target protein to form a complex that can be distinguished 
reliably from products of nonspecific interactions. As used 
herein, ."antibody 11 is understood to include other binding 

30 proteins having appropriate binding affinity and specificity for 
the target protein. The higher the antibody binding 
specificity, the lower the target protein concentration that can 
be detected. A preferred binding specificity is such that the 
binding protein has a binding affinity for the target protein of 

35 greater than about 10 5 M" 1 , and preferably greater than about 
10 7 M" 1 . 
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Antibody binding domains also may be produced 
biosynthetically and the amino acid sequence of the binding 
domain may be manipulated to enhance binding affinity with a 
preferred epitope on the target protein. Specific antibody 
5 methodologies are well understood and described in the 

literature. A more detailed description of their preparation 
can be found/ for example, in Practical Immunology, Butt, W.R., 
ed,, Marcel Dekker, New York, 1984, incorporated by reference 
herein. Optionally, a monovalent antibody such as a Fab 
10 antibody fragment may be utilized. Additionally, genetically 
engineered biosynthetic antibody binding sites may be utilized 
which comprise either 1) non-covalently associated or disulfide 
bonded synthetic V H and V L dimers, 2) covalently linked V H -V L 
single chain binding sites, 3) individual Vpj or domains, or 

15 4) single chain antibody binding sites as disclosed, for example 
in Huston et al., U.S. Patent Nos. 5,091,513 and 5,132,405, and 
in Ladner et al . , U.S. Patent Nos. 4,704,692 and 4,946,778, the 
disclosures of which are incorporated by reference herein. 
Antibodies— to--isolated--tar-ge-t^--br.east_ cancer- associated-_or 

20 normal breast tissue-associated proteins that are useful in 
assays for detecting breast cancer in an individual may be 
generated using standard immunological procedures well known and 
described in the art. See, for example, Practical Immunology, 
Butt, N.R., ed., Marcel Dekker, NY, 1984. Briefly, an isolated 

25 target protein is used to raise antibodies in a xenogeneic host, 
such as a mouse, goat or other suitable mammal. Preferred 
antibodies are antibodies that bind specifically to an epitope 
on the protein, preferably having a binding affinity greater 
than 10 5 M" 1 , most preferably having an affinity greater than 

30 10 7 M" 1 for that epitope. 

The protein is combined with a suitable adjuvant capable of 
enhancing antibody production in the host, and injected into the 
host, for example, by intraperitoneal administration. Any 
adjuvant suitable for stimulating the host's immune response may 

35 be used to advantage. A commonly used adjuvant is Freund's 
complete adjuvant (an emulsion comprising killed and dried 
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microbial cells, e.g., from Calbiochem Corp., San Diego, or 
Gibco, Grand Island, NY) . Where multiple antigen injections are 
desired, the subsequent injections comprise the antigen in 
combination with an incomplete adjuvant (e.g., cell-free 
5 emulsion) . 

Polyclonal antibodies may be isolated from the antibody- 
producing host by extracting serum containing antibodies to the 
protein of interest. Monoclonal antibodies may be produced by 
isolating host cells that produce the desired antibody, fusing 

10 these cells with myeloma cells using standard procedures known 
in the immunology art, and screening for hybrid cells 
(hybridomas) that react specifically with the target protein and 
have the desired binding affinity. 

Provided below is an exemplary protocol for monoclonal 

15 antibody production, which is currently preferred. Other 
protocols also are envisioned. Accordingly, the particular 
method of producing antibodies to target proteins is not 
envisioned to be an aspect of the invention. 

Monoclonal, antibodies to any target protein, and especially 

20 a nuclear matrix protein associated with breast cancer may be 
readily prepared using methods available in the art, including 
those described in Kohler, et al. , Nature, 256: 495 (1975) for 
fusion of myeloma cells with spleen cells. 

The presence of breast cancer in an individual also may be 

25 determined by detecting, in a tissue or body fluid sample, a 
nucleic acid molecule encoding a breast cancer-associated 
protein. Using methods well known to those of ordinary skill in 
the art, breast cancer-associated nuclear matrix proteins may be 
sequenced, and then, based on the determined sequence, 

30 oligonucleotide probes may be designed for screening a cDNA 

library to determine the sequence of nucleic acids encoding for 
the target proteins. (See, e.g., Maniatis et al . , "Molecular 
Cloning: A Laboratory Manual," Cold Spring Harbor Press, 
(1989) ) . 

35 A target nucleic acid molecule, encoding a breast cancer- 

associated protein, may be detected using a binding moiety, 
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optionally labeled, capable of specifically binding the target 
nucleic acid. The binding moiety may comprise, for example, a 
protein or a nucleic acid. Additionally, a target nucleic acid, 
such as an mRNA encoding a breast cancer-associated nuclear 

5 matrix protein, may be detected by conducting a northern blot 
analysis using labeled oligonucleotides, (e.g., a nucleic acid 
fragments complementary to and capable of hybridizing 
specifically with at least a portion of a target nucleic acid) . 
While any length oligonucleotide may be utilized to hybridize an 

10 mRNA transcript, oligonucleotides typically within the range of 
8-100 nucleotides, preferably within the range of 15-50 
nucleotides, are envisioned to be most useful in standard RNA 
hybridization assays. 

The oligonucleotide selected for hybridizing to the target 

15 nucleic acid, whether synthesized chemically or by recombinant 
DNA techniques, is isolated and purified using standard 
techniques and then preferably labeled (e.g., with 35 S or 32 P) 
using standard labeling protocols. A sample containing the 
target nucleic a ^i d then is run on an e lec tro p hore sis gel, the 

20 dispersed nucleic acids transferred to a nitrocellulose filter 
and the labeled oligonucleotide exposed to the filter under 
suitable hybridizing conditions, e.g. 50% formamide, 5 X SSPE, 2 
X Denhardt's solution, 0.1% SDS at 42°C, as described in 
Maniatis et al., "Molecular Cloning: A Laboratory Manual," Cold' 

25 Spring Harbor Press, (1989) . Other useful procedures known in 
the art include solution hybridization, and dot and slot RNA 
hybridization'. The amount of the target nucleic acid present in 
a sample then optionally is quantitated by measuring the 
radioactivity of hybridized fragments, using standard procedures 

30 known in the art. 

Following a similar protocol, oligonucleotides also may be 
used to identify other sequences encoding members of the target 
protein families. The methodology also may be used to identify 
genetic sequences associated with the nucleic acid sequences 

35 encoding the proteins described herein, e.g., to identify non- 
coding sequences lying upstream or downstream of the protein 
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coding sequence, and which may play a functional role in 
expression of these genes. Additionally, binding assays may be 
conducted to identify and detect proteins capable of a specific 
binding interaction with a nucleic acid encoding a breast 

5 cancer-associated protein, which may be involved e.g., in gene 
regulation or gene expression of the protein. In a further 
embodiment, the assays described herein may be used to identify 
and detect nucleic acid molecules comprising a sequence capable 
of recognizing and being specifically bound by a breast cancer- 

10 associated nuclear matrix protein. 
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Example 5 

Identification and Therapeutic Use of Compounds that 
Interact With Breast Cancer-Associated Proteins 

Methods are provided to screen small molecules for those 
that inhibit the function of breast cancer-associated proteins. 
Such methods typically involve construction of a screening 
system in which breast cancer-associated proteins are linked to 
DNA binding proteins that are responsible, in part, for 
transcription initiation. 

cDNA encoding peptides or peptide fragments capable of 
interacting with breast cancer-associated proteins (BCAPs) are 
determined using a two-hybrid assay as reported in Durfee, et 
al., Genes & Develop., 7: 555-559 (1993), incorporated by 
reference herein. The two-hybrid assay is based upon detection 
of the expression of a reporter gene which is only produced when 
two fusion proteins, one comprising a DNA-binding domain and one 
comprising a transcription initiation domain, interact. 

A host cell that contains one or more reporter genes, such 

as ye^t strain Y153, repo rted i n Du rfee , Supra., is used. 

Expression of the reporter genes is regulated by the Gal4 
promoter. However, the host cell is deleted for Gal4 and its 
negative regulator, Gal80. Thus, host cells are turned off for 
expression of the -reporter gene or genes which are coupled to 
the uasg {the Gal upstream activating sequence) . 

Two sets of plasmids are then made. One contains DNA 
encoding a Gal 4 DNA-binding domain fused in frame to DNA 
encoding a breast cancer-associated protein (BCAP) . A second 
list of plasmids contains DNA encoding a Gal4 activation domain 
fused to portions of a human cDNA library constructed from human 
lymphocytes. Expression from the first plasmid results in a 
fusion protein comprising a Gal4 DNA-binding domain and a BCAP. 
Expression from the second plasmid produces a transcription 
activation protein fused to an expression product from the 
lymphocyte cDNA library. When the two plasmids are transformed 
into a gal-deficient host cell, such as the yeast Y153 cells 
described above, interaction of the Gal DNA binding domain and 
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transcription activation domain will occur only if the BCAP that 
is fused to the DNA binding domain binds to a protein expressed 
from the lymphocyte cDNA library fused to the transcription 
activating domain. The result of such a fusion is transcription 
initiation and expression of the reporter gene. A schematic 
diagram showing the aforementioned relationship is found in 
figure 3. 

Example 6 

Identification of Inhibitory Compounds 
The invention also provides means for identifying 
compounds, including small molecules, which inhibit specific 
interaction between a breast cancer-associated protein and its 
binding partner. In these methods, a host cell is transfected 
with DNA encoding a suitable DNA binding domain/breast cancer- 
associated protein hybrid and a translation activation 
domain/putative breast cancer-associated protein binding partner 
as disclosed above. 

The host cell also contains a suitable reporter gene in 



operative association with a cis-acting transcription activating 
element recognized by the transcription factor DNA binding 
domain. One particularly useful reporter gene is the 
lucif erase gene. Others include the lacZ gene, HIS3, LEU2, and 
GFP (Green Fluorescent Protein) genes. The level of reporter 
gene expressed in the system is first assayed. The host cell is 
then exposed to the candidate molecule and the level of reporter 
gene expression is detected. A reduction in reporter gene . 
expression is indicative of the candidate's ability to interfere 
with complex formation or stability with respect to the breast 
cancer-associated protein. As a control, the candidate 
molecule's ability to interfere with other, unrelated protein- 
protein complexes is also tested. Molecules capable of 
specifically interfering with a breast cancer-associated 
protein/binding partner interaction, but not other protein- 
protein interactions, are identified as candidates for 
production and further analysis. Once a potential candidate has 
been identified, its efficacy in modulating cell cycling and cell 
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replication can be assayed in a standard cell cycle model 
system. 

Candidate molecules can be produced as described herein. In 
addition, derivatives of candidate sequences can be created 
having, for example, enhanced binding affinity. 

Example 7 

Production of BCAP Binding Proteins 
DNA encoding breast cancer-associated proteins can be 
inserted, using conventional techniques well described in the 
art (see, for example, Maniatis (1989) Moiecuiar Cloning A 
Laboratory Manual), into any of a variety of expression vectors 
and transfected into an appropriate host cell to produce 
recombinant proteins, including both full length and truncated 
forms. Useful host cells include E. coli, Saccharomyces 
cerevisiae, Pichia pastoris, the insect/baculovirus cell system, 
myeloma cells, and various other mammalian cells. The full 
length forms of the proteins of this invention are preferably 
pxpressed in mammalian cel ls, as disclos ed herein. The_ 



20 



30 



nucleotide sequences also preferably include a sequence for 
targeting the translated sequence to the nucleus, using, for 
example, a sequence encoding the eight amino acid nucleus 
targeting sequence of the large T antigen, which is well 
characterized in the art. The vector can additionally include 
various sequences to promote correct expression of the 
25 recombinant protein, including transcription promoter and 

termination sequences, enhancer sequences, preferred ribosome 
binding site sequences, preferred mRNA leader sequences, 
preferred protein processing sequences, preferred signal 
sequences for protein secretion, and the like. The DNA sequence 
encoding the gene of interest can also be manipulated to remove 
potentially inhibiting sequences or to minimize unwanted 
secondary structure formation. As will be appreciated by the 
practitioner in the art, the recombinant protein can also be 
expressed as a fusion protein. 
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After translation, the protein can be purified from the 
cells themselves or recovered from the culture medium. The DNA 
can also include sequences which aid in expression and/or 
purification of the recombinant protein . The DNA can be 
expressed directly or can be expressed as part of a fusion 
protein having a readily cleavable fusion junction. An 
exemplary protocol for prokaryote expression is provided below. 
Recombinant protein is expressed in soluble form or in inclusion 
bodies, and can be purified therefrom using standard technology. 

The DNA may also be expressed in a suitable mammalian host. 
Useful hosts include fibroblast 3T3 cells, (e.g., NIH 3T3, from 
CRL 1658) COS (simian kidney ATCC, CRL-1650) or CHO (Chinese 
hamster ovary) cells (e.g., CH0-DXB11, from Lawrence Chasin, 
Proc. Nat'l. Acad. Sci. (1980) 77(7) : 4216-4222) , mink-lung 
epithelial cells (MVlLu) , human foreskin fibroblast cells, human 
glioblastoma cells, and teratocarcinoma cells. Other useful 
eukaryotic cell systems include yeast cells, the 
insect/baculovirus system or myeloma cells. 

To__express-a -breas_t_ canGe -^^ 

DNA is subcloned into an insertion site of a suitable, 
commercially available vector along with suitable 
promoter/enhancer sequences and 3 ! termination sequences. 
Useful promoter/enhancer sequence combinations include the CMV 
promoter (human cytomegalovirus (MIE) promoter) present, for 
example, on pCDM8, as well as the mammary tumor virus promoter 
(MMTV) boosted by the Rous sarcoma virus LTR enhancer sequence 
(e.g., from Clontech, Inc., Palo Alto). A useful inducible 
promoter includes, for example, A Zn z+ induceable promoter, such 
as the Zn 2+ metallothionein promoter (Wrana et al. (1992) Cell 
71:1003-1014.) Other inducible promoters are well known in the 
art and can be used with similar success. Expression also can 
be further enhanced using transactivating enhancer sequences. 
The plasmid also preferably contains an amplifiable marker, such 
as DHFR under suitable promoter control, e.g., SV40 early 
promoter (ATCC #37148) . Transf ection, cell culturing, gene 
amplification and protein expression conditions are standard 
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conditions, well known in the art, such as are described, for 
example in Ausubel et al., ed., Current Protocols in Molecular 
Biology, John Wiley & Sons, NY (1989) . Briefly, transfected 
cells are cultured in medium containing 5-10% dialyzed fetal 
5 calf serum (dFCS) , and stably transfected high expression cell 
lines obtained by amplification and subcloning and evaluated by 
standard Western and Northern blot analysis. Southern blots 
also can be used to assess the state of integrated sequences and 
the extent of their copy number amplification. 

10 The expressed protein is then purified using standard 

procedures. A currently preferred methodology uses an affinity 
column, such as a ligand affinity column or an antibody affinity 
column. The bound material is then washed, and receptor 
molecules are selectively eluted in a gradient of increasing 

15 ionic strength, changes in pH, or addition of mild detergent. 

The therapeutic efficacy of treating breast cancer with 
inhibitors of breast cancer-associated proteins according to the 
invention is measured by the amount of breast cancer-associated 
nuclear matrix protein released from breast cancer cells that 

20 are undergoing cell death. As reported in PCT publication 

WO93/05432 (US 92/9220, filed October 29, 1992), incorporated by 
reference herein, soluble nuclear matrix proteins and fragments 
thereof are released by cells upon cell death. Such soluble 
nuclear matrix proteins can be quantitated in a body fluid and 

25 used to monitor the degree or rate of cell death in a tissue. 
For example, the concentration of body fluid-soluble nuclear 
matrix protiens or fragments thereof released from cells is 
compared to standards from healthy, untreated tissue. Fluid 
samples are collected at discrete intervals during treatment and 

30 compared to the standard. Changes in the level of soluble 

breast cancer-associated nuclear matrix protein are indicative 
of the efficacy of treatment (i.e., the rate of cancer cell 
death) . Appropriate body fluids for testing include blood, 
serum, plasma, urine, semen, sputum, breast exudate. 
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Thus, breast cancer may be identified by the presence of 
breast cancer-associated nuclear matrix proteins as taught 
herein. Once identified in this way, breast cancer may be 
treated using inhibitors of the nuclear matrix proteins and the 
progress of such treatment, including dosing considerations, may 
be monitored by the release of soluble breast cancer-associated 
nuclear matrix proteins from breast cancer cells which have died 
or are dying as a result of such treatment. Similarly, 
monitoring the release of soluble nuclear matrix proteins from 
breast cancer cells is useful for monitoring the treatment of 
breast cancer by means other than those reported herein or such 
other means in combination with treatment means reported herein. 

Those skilled in the art will know, or be able to ascertain 
using no more than routine experimentation, many equivalents to 
the specific embodiments of the invention described herein. 
These and all other equivalents are intended to be encompassed 
by the following claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MAT RI TECH, INC. 

(B) STREET: 330 NEVADA STREET 

(C) CITY: NEWTON 

(D) STATE: MASSACHUSETTS 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP): 02160 
G TELEPHONE: 1-617-928-0820 

<H) TELEFAX: 1-617-928-0821 
(I) TELEX: 

(ii) TITLE OF INVENTION : MATERIALS AND METHODS FOR DETECTION OF 
BREAST CANCER 

<iii> NUMBER OF SEQUENCES : 14 

fiv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Testa, Hurwitz & Thibeault 

(B) STREET: 125 High St. 

(C) CITY: Boston 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02110 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

C OPERATING SYSTEM: PC- DOS /MS- DOS 

Id) SOFTWARE: Patentln Release #1.0, Version 111.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

"(C) "CLASS IFICATI ON : - ~ 

(viii) ATTORNEY/ AGENT INFORMATION : 

(A) NAME: MEYERS, THOMAS C 

(B) REGISTRATION NUMBER: 36,989 

(C) REFERENCE/ DOCKET NUMBER: MTP-021PC (8395/24) 



(ixi TELECOMMUNICATION INFORMATION : 
1 (A) TELEPHONE: (617) 248-7000 

<B) TELEFAX : (617) 248-7100 

(2) INFORMATION FOR SEQ ID NO:l: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Asp Leu lie Ser His Asp Glu Met Phe Ser Asp lie Tyr Lys 
- 1 5 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Thr Glu Gly Asn lie Asp Asp Ser Leu He Gly Gly Asn Ala Ser Ala 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Lys Ala Glu Ala Ala Ala Ser Ala Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 4: 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Lys Phe Val Leu Met Arg 

1 5 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Ala Asn He Gin Ala Val Ser Leu Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO: 6: 

" (ii SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Ser Asp Val Val Pro Met Thr Ala Glu Asn Phe Arg 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

lie lie Pro Gin Phe Met Cys Gin Gly Gly Asp Phe Xaa Asn His Arg 
15 10 IS 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 
IB) TYPE: amino acid 
(C) STRANDEDNESS: 
fD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



__(_xi-)_ SEQUENCE-DESCRI-P-TIOR: -SEQ- ID__NO.:_8_:_ 

Lys Phe Asp Asp Glu Asn Phe lie Leu Arg 
I 5 ' 10 



(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

( B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

His Val Val Phe Gly Glu Val Thr Glu Gly Leu Asp Val Leu Arg 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
tC) STRANDEDNESS: 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
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Val He He Ala Asp Cys Gly Glu Tyr 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 613 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 
(B> LOCATION: 1..519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

AGA TGG CCA AGC AAG GCC AGA TGG ATG CTG TTC GCA TCA TGG CAA AAG 4R 
Arg Trp Pro Ser Lys Ala Arg Trp Met Leu Phe Ala Ser Trp Gin Lys 
15 10 is 

ACT TGG GTT GCA CCC GGC TAT GTG CGC AAG TTT GTA TTG ATG CGG GCC Qfi 
Thr Trp Val Ala Pro Gly Tyr Val Arg Lys Phe Val Leu Met Arq Ala 
20 25 30 

AAC ATC CAG GCT GTG TCC CTC AAG ATC CAG ACA CTC AAG TCC AAC AAC 14 4 

Asn He Gin Ala Val Ser Leu Lys lie Gin Thr Leu Lys Ser Asn Asn 
35 40 45 

TCG ATG GCA CAA GCC ATG AAG GGT GTC ACC AAG GCC ATG GGC ACC ATG 192 
Ser Met Ala Gin Ala Met Lys Gly Val Thr Lys Ala Met Gly Thr Met 
50 55 60 

AAC AGA CAG CTG AAG TTG CCC CAG ATC CAG AAG ATC ATG ATG GAG TTT 24 0 

Asn Arg Gin Leu Lys Leu Pro Gin He Gin Lys He Met Met Glu Phe 
65 . ™ 75 ' 80 

~GAG~CG^CAG GCA GAG~ATC~ ATgTgAT ATG AAG GAcTgAG ATG ATG AAT GAT~ ~ ^2 88~ 
Glu Arg Gin Ala Glu lie Met Asp Met Lys Glu Glu Met Met Asn Asd 
85 90 95 K 

GCC ATT GAT GAT GCC ATG GGT GAT GAG GAA GAT GAA GAG GAG AGT GAT 3 36 

Ala He Asp Asp Ala Met Gly Asp Glu Glu Asp Glu Glu Glu Ser Asd 
100 105 no H 

GCT GTG GTG TCC CAG GTT CTG GAT GAG CTG GGA CTT AGC CTA ACA GAT 384 
Ala Val Val Ser Gin Val Leu Asp Glu Leu Gly Leu Ser Leu Thr Asp 
115 120 125 

GAG CTG TCG AAC CTC CCC TCA ACT GGG GGC TCG CTT AGT GTG GCT GCT 4 3? 

Glu Leu Ser Asn Leu Pro Ser Thr Gly Gly Ser Leu Ser Val Ala Ala 
130 135 140 

GGT GGG AAA AAA ' GCA GAG GCC GCA GCC TCA GCC CTA GCT GAT GCT GAT 4 80 

Gly Gly Lys Lys Ala Glu Ala Ala Ala Ser Ala Leu Ala Asp Ala Asp 
145 150 155 K i6u 

GCA GAC CTG GAG GAA CGG CTT AAG AAC CTG CGG AGG GAC TGAGTGCCCC 52 9 

Ala Asp Leu Glu Glu Arg Leu Lys Asn Leu Arq Arg Asp 
165 170 H 

TGCCACTCCG AGATAACCAG TGGATGCCCA GGATCTTTTA CCACAACCCC TCTGTAATAA 589 
AAGAGATTTG ' AC ACTAAAAA AAAA 6 x 3 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 173 amino acids 

( B ) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Arg Trp Pro Ser Lys Ala Arg Trp Met Leu Phe Ala Ser Trp Gin Lvs 
1 5 10 15 * 

Thr Trp Val Ala Pro Gly Tyr Val Arg Lys Phe Val Leu Met Arq Ala 
20 25 30 

Asn He Gin Ala Val Ser Leu Lys He Gin Thr Leu Lys Ser Asn Asn 
35 40 45 

Ser Met Ala Gin Ala Met Lys Gly Val Thr Lys Ala Met Gly Thr Met 
50 55 60 

Asn Arg Gin Leu Lys Leu Pro Gin He Gin Lys He Met Met Glu Phe 
65 70 75 80 

Glu Arg Gin Ala Glu He Met Asp Met Lys Glu Glu Met Met Asn Asp 
85 90 95 

Ala He Asp Asp Ala Met Gly Asp Glu Glu Asp Glu Glu Glu Ser Asd 
100 105 HO 

Ala Val Val Ser Gin Val Leu Asp Glu Leu Gly Leu Ser Leu Thr Asd 
115 120 125 

Glu Leu Ser Asn Leu Pro Ser Thr Gly Gly Ser Leu Ser Val Ala Ala 
130 135 140 

Gly Gly Lys Lys Ala Glu Ala Ala Ala Ser Ala Leu Ala Asp Ala Asp 
145 - 150 155 160 

Ala Asp Leu Glu Glu Arg Leu Lys Asn Leu Arg Arq Asp 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

("if" S EQU ENC E~CHARACTERI STIC S: " 

(A) LENGTH: 121 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Cys Gin Gly Gly Asp Phe Thr Asn His Asn Gly thr Gly Gly Lys Ser 
15 10 15 

He Tyr Gly' Lys Lys Phe Asp Asp Glu Asn Phe He Leu Lys His Thr 
20 25 30 

Gly Pro Gly Xaa Xaa Leu Ser Met Ala Asn Ser Gly Pro Lys His Gin 
35 40 45 

Trp Leu Ser Val Leu Pro Asp Met Leu Thr Arg Gin Thr Gly Trp Asp 
50 55 60 

Gly Gin Ala Cys Gly Val Xaa Glu Arg Phe Thr Glu Gly Leu Arg Xaa 
65 70 75 80 

Val Leu Arg Gin lie Glu Ala Gin Gly Ser Lys Asp Gly Lys Pro Lys 
85 90 95 

Gin Lys Val lie He Ala Asp Cys Gly Glu Tyr Val Leu Arg Ala Ala 
100 105 110 

Leu Ser Leu Leu Ser Pro Ser Ala Leu 
115 120 
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(2) INFORMATION FOR SEQ ID NO: 14: 

{i} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 141 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Arg Ser Asp Val Val Pro Met Thr Ala Glu Asn Phe Arq Cys Leu 
15 10 15 

Cys Thr His Glu Lys Gly Phe Gly Phe Lys Gly Ser Ser Phe His Arq 
20 25 30 

He He Pro Gin Phe Met Cys Gin Gly Gly Asp Phe Thr Asn His Asn 
35 40 45 

Gly Thr Gly Gly Lys Ser He Tyr Gly Lys Lys Phe Asp Asp Glu Asn 
50 55 60 

Phe He Leu Lys His Thr Gly Pro Gly Xaa Xaa Leu Ser Met Ala Asn 
65 70 *75 80 

Ser Gly Pro Lys His Gin Trp Leu Ser Val Leu Pro Asp Met Leu Thr 
85 90 95 

Arg Gin Thr Gly Trp Asp Gly- Gin Ala Cys Gly Val Xaa Glu Arcr Phe 
100 105 110 

Thr Glu Gly Leu Arg Xaa Vai Leu Arg Gin He Glu Lys Gin Glu Glu 
115 120 . 125 



Ser Ala He Thr Ser Gin Pro Arg Xaa Trp Lys Leu -,Thr 
130 135 140 
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WHAT IS CLAIMED IS: 

11. A method for diagnosing breast cancer in tissue or body 

2 fluid, comprising the step of: 

3 detecting the presence of a breast cancer-associated 

4 protein in said tissue or body fluid. 

1 2. The method according to claim 1, wherein said breast 

2 cancer-associated protein is a nuclear matrix protein. 

1 3. The method according to claim 1, wherein said detecting 

2 step comprises detecting a plurality of said breast cancer- 

3 associated proteins. 

1 4. The method according to claim 1, wherein said breast 

2 cancer-associated protein has a molecular weight of from 

3 about 22,000 Daltons to about 81,000 Daltons and an 

4 isoelectric point of from about 5.24 to about 7.0. 

1 5. The method according to claim 4, wherein said breast 

2 cancer-associated protein has a molecular weight of about 

- 3 32— 500 -Da-ltons-and-an— isoe-l-eetr-i-c-po-i-nt— o-f—abo-ut— 6-8.2 

1 6. The method according to claim 5, wherein a portion of 

2 said breast cancer-associated protein comprises a continuous 

3 amino acid sequence selected from the group consisting of 

4 SEQ ID NO: 3, SEQ ID NO: 4, and SEQ ID NO: 5. 

1 7. The method according to claim 4, wherein said breast 

2 cancer-associated protein has a molecular weight of about 

3 22,500 Daltons and an isoelectric point of about 5.6. 

1 8. The method according to claim 7, wherein a portion of 

2 said breast cancer-associated protein comprises a continuous 

3 amino acid sequence selected from the group consisting of 

4 SEQ ID NO: 1, and SEQ ID NO: 2. 

1 9. The method according to claim 4, wherein said breast 

2 cancer-associated protein has a molecular weight of about 

3 33 f 000 Daltons and an isoelectric point of about 6.4. 
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1 10. The method according to claim 9, wherein a portion of 

2 said breast cancer-associated protein comprises a sequence 

3 selected from the group consisting of SEQ ID NO: 6, SEQ ID 

4 NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10. 

1 11. The method according to claim 1, wherein said breast 

2 cancer-associated protein comprises an amino acid sequence 

3 shown in SEQ ID NO: 12. 

1 12. The method according to claim 1, wherein said breast 

2 cancer-associated protein comprises an amino acid sequence 

3 shown in SEQ ID NO: 13. 

1 13. The method according to claim 1, wherein said breast 

2 cancer-associated protein comprises an amino acid sequence 

3 shown in SEQ ID NO: 14. 

1 14. The method according to claim 1, wherein said detecting 

2 step is carried out in a sample of breast tissue. 

1 15. The method according to claim 1, wherein said detecting 

2 step i s ca rried out in a sampl e o f body' fluid. 



1 16. The method according to claim 15, wherein said sample of 

2 body fluid comprises blood. 

1 17. The method according to claim 1, wherein said detecting 

2 step comprises exposing said tissue or body fluid to an 

3 antibody directed against an epitope on said breast cancer- 

4 associated protein. 

1 18. The method according to claim 17, wherein said antibody 

2 is a monoclonal antibody. 

1 19. The method according to claim 18, wherein said antibody 

2 is a polyclonal antibody. 

1 20. The method according to claim 17, wherein said antibody 

2 is detectably labeled. 

1 21. The method according to claim 20, wherein said label 

2 comprises a member of the group consisting of radioactive 
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3 labels, hapten labels, fluorescent labels, and enzymatic 

4 labels. 

1 22. The method according to claim 1, wherein said detecting 

2 step comprises amplifying nucleic acid encoding said breast 

3 cancer-associated protein in a polymerase chain reaction. 

1 23. The method according to claim 22, wherein said polymerase 

2 chain reaction is a reverse transcriptase polymerase chain 

3 reaction . 

1 24. A method for diagnosing the presence of breast cancer in 

2 a biological sample, comprising the steps of 

3 exposing said biological sample under hybridization 

4 conditions to a nucleic acid probe capable of hybridizing to 

5 a nucleic acid encoding a breast cancer-associated protein; 

6 and 

7 detecting duplex formed between said nucleic acid 

8 probe and said nucleic acid encoding said breast cancer- 

9 associated protein. 

l 25~. An ~an t rbo d y ~t h a t s p e c -i-f-i c -a-H-y— b -i-n d s— t o- -a-n— ep-i-fco p e-o-n— a- 

2 breast cancer-associated protein, said breast cancer- 

3 associated protein having a molecular weight of from about 

4 22,000 Daltons to about 81,000 Daltons and an isoelectric 

5 point of from about 5.24 to about 7.0. 

1 26. The antibody according to claim 25, wherein said antibody 

2 specifically binds to a breast cancer-associated protein 

3 having a molecular weight of about 32,500 Daltons and an 

4 isoelectric point of about 6.82. 

1 27. The antibody according to claim 26, wherein said antibody 

2 recognizes an epitope on said breast cancer-associated 

3 protein comprising a continuous sequence of amino acids 

4 selected from the group consisting of SEQ ID NO: 3, SEQ ID 

5 NO: 4, and SEQ ID NO: 5. 

1 28. The antibody according to claim 25, wherein said antibody 

2 specifically binds to a breast cancer-associated protein 



WO 97/46884 



-37- 



PCT/US97/09529 



3 having a molecular weight of about 22,500 Daltons and an 

4 isoelectric point of about 5.6. 

1 29. The antibody according to claim 28, wherein said antibody 

2 recognizes an epitope on said breast cancer-associated 

3 protein comprising a continuous sequence of amino acids 

4 selected from the group consisting of SEQ ID NO: l,and SEQ 

5 ID NO: 2. 

1 30. The antibody according to claim 25, wherein said antibody 

2 binds to a breast cancer-associated nuclear protein having a 

3 molecular weight of about 33,000 Daltons and an isoelectric 

4 point of about 6.40. 

1 31. The antibody according to claim 30, wherein said antibody 

2 recognizes an epitope of said breast cancer-associated 

3 protein comprising a continuous sequence of amino acids 

4 selected from the group consisting of SEQ ID NO: 6, SEQ ID 

5 NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, and SEQ ID NO: 10. 

1 32 . An oligonucleotide probe for detecting nucleotides 

2— — e n cod i ng — a'-b r e a s t- -c a nc e r~a s s o ci a te d— p to te-i-n— i-n — b r ea s t- 

3 tissue. 

1 33. A method for treating breast cancer, comprising the step 

2 of 

3 administering to a patient diagnosed as having breast 

4 cancer a therapeutically-ef f ective amount of an antibody 

5 according to claim 25. 

1 34. A pharmaceutical composition for treatment of breast 

2 cancer, comprising an antibody according to claim 25 in a 

3 pharmaceutically-acceptable carrier. 

1 35. A method for treating breast cancer, comprising the step 

2 of administering to a patient a pharmaceut ically-ef f ective 

3 ~ amount of a composition comprising a compound that inhibits 

4 activity of a breast cancer-associated protein. 

1 36. A pharmaceutical composition comprising a compound that 

2 inhibits activity of a breast cancer-associated protein. 
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1 37. The pharmaceutical composition according to claim 36, 

2 wherein said breast cancer-associated protein is a nuclear 

3 matrix protein. 
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